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Abstract 

Let Gs-out denote the random graph on vertex set [n] in which each vertex 
chooses 3 neighbors uniformly at random. Note that Ga-out has minimum 
degree 3 and average degree 6. We prove that the probabihty that Gs-out is 
Hamiltonian goes to 1 as n tends to infinity. 



1 Introduction 

One of the natural questions for a sparse random graph model is whether or not 
a random instance is Hamiltonian whp. The Hamiltonicity threshold for the basic 
models Gn,m and Gn,p was established quite precisely by Komlos and Szemeredi [14]. 
See BoUobas [3], Ajtai, Komlos and Szemeredi [1] and Bollobas and Frieze [4] for 
refinements. If m = |(logn + log log n + c) then 

lim Pr{Gn,m is Hamiltonian) = lim Pr{5{Gn,m) > 2) = e"*^ 

where S denotes minimum degree. The lesson here is that the minimum degree seems 
to be the most important factor in determining the likelihood of Hamiltonicity in 
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a random graph. This naturally leads to the consideration of models where the 
minimum degree condition is automatically satisfied. If we simply condition on Gn,m 
having minimum degree 2 then about |n log n random edges are needed (see BoUobas, 
Fenner and Frieze [7]). The barrier to Hamiltonicity in this model is the existence of 
three vertices of degree 2 that have a common neighbor. So, we expect even sparser 
random graph models with minimum degree at least three to have Hamilton cycles 
whp (note that if we have a linear number of vertices of degree 2 it may be very likely 
that there are three vertices of degree 2 that have a common neighbor). We focus our 
attention here on random graphs with only a linear number of edges and minimum 
degree 3. BoUobas, Cooper, Fenner and Frieze [7] showed that if we condition on Gn,m 
having minimum degree 3 then m — 128n random edges suffice to give a Hamilton 
cycle whp. We believe that this result holds so long as the average degree is at least 
3: 

Conjecture. For any c > 3/2 the random graph Gn,cn conditioned on minimum 
degree 3 has a Hamilton cycle with high probability. 

Another well-studied sparse random graph is the random regular graph. Let Or 
denote a graph chosen uniformly at random from the set of r-regular graphs with 
vertex set [n]. Robinson and Wormald [17] showed Or is Hamiltonian whp for r > 3, 
r constant. Allowing r to grow with n presented some challenges, but they have now 
been resolved (see Cooper, Frieze and Reed [9] and Krivelevich, Sudakov, Vu and 
Wormald [15]). 

We now come to the topic of this paper: Gm-out- We begin with vertex set V = [n]. 
Each V & V independently chooses m random out-neighbors to create the random 
digraph Dm-out- We then obtain Gm-out by ignoring orientation. Note that Gm-out is 
a graph with minimum degree m and average degree 2m. Note further that there is 
the potential for multiple edges, but the expected number is O(m^). So, we can either 
allow these multiple edges or condition on them not occurring. Since the probability 
that there are no multiple edges is bounded away from zero, any property that holds 
whp in the model that allows multiple edges also holds whp when we condition on 
no multiple edges. The Hamiltonicity of Gm-out was first discussed by Fenner and 
Frieze [11]. They showed that G23-out is Hamiltonian whp. This was improved to 
Gio-out by Frieze [12] and to G^^out by Frieze and Luczak [13]. Cooper and Frieze [8] 
showed that the digraph -D2-in,2-out (here each vertex chooses 2 out-neighbors and 2 
in-neighbors uniformly at random) has a directed Hamilton cycle whp. This imphes 
that G4-out is Hamiltonian whp. So the main open question is whether Gs-out is 
Hamiltonian whp or not (note that, whp G2-out contains 3 vertices of degree 2 that 
have a common neighbor and is therefore not Hamiltonian whp). We settle this 
question: 
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Theorem 1. 



lim Pr(G3_out is Hamiltonian) — 1. 



We give a proof of Theorem 1 in Section 2. The proof hinges on a few key Lemmas, 
the proofs of which are given in later sections. 



The proof has two main steps. Before the first step, we set aside a smaU set of edges 
that will not be used until the second step. In the first step we find a collection of 
vertex-disjoint paths and cycles that covers the vertex set. The set-aside edges are 
then used in the second step to transform this collection of cycles and paths into a 
Hamilton cycle. 

Of course, we need some notation. For a digraph D = {V, A) and v ^ V let 
d~^{v), dj){v) denote the out-degrcc and in-dcgrcc, respectively, of v in D. Let V = [n] 
and let fl denote the set of digraphs with vertex set V in which d^{v) = 3 for v E V . 
It will be convenient to let the out- arcs for each vertex be chosen with replacement 
and loops allowed. Thus \Q\ = n^" and D^_out is sampled uniformly from Q. For a 
digraph D let To be the underlying graph obtained by ignoring orientation. Thus 

G^~out = ^ Da-out ■ 

We separate the edge set as follows. We begin by randomly selecting a set of vertices 
K and setting aside the third out-arc of each vertex in K. Formally, for K C V define 
Qk to be the set of digraphs D for which d'^{v) = 2 for all v E K and d'j^{v) = 3 for 
allveV\K. Thus, = n^i^-''^+^'' = 71^"-*=. Further, set 



We work with the uniform distribution on Qk x [n]^- The first component gives all 
but k of the arcs in D^-out and the second component determines the heads of the 
remaining k arcs. For D E we let 



We add the third out-arc of each vertex v G L[D) back into the graph (thereby 
arriving at graph with minimum degree 3). The resulting digraph is used for the 
first part of the proof. The remaining set-aside arcs (i.e. the third out-arc from each 
V e K\L) are used for the second part of the argument. 



2 Proof of Theorem 1 



K6(W) 



K{D) 
L{D) 



{veV : d+{v) = 2} 
{v e K{D) : d-M - 0}. 
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Set 



Let Di = {V, Ai) denote the digraph drawn uniformly at random from Qk- Let A be 
the set of arcs then determined by the randomly chosen fc-tuple in [n]''. Let A2 be 
the set of arcs in A with tails in L{Di) and set = ^4 \ ^42. Let be the set of 
edges given by ignoring the orientations of the arcs in A3. Set D2 = Di + A2 and 
G2 = ^D2- III first part of the argument we find a collection of paths and cycles 
in G2 that covers the vertex set. The edges in are then used in the second part 
of the argument to convert this collection of paths and cycles into a Hamilton cycle. 
Since G2 is a super-graph of G2-out we have the following: 

Lemma 2 (Fenner, Frieze [10]). G2 is connected whp. 

Define a simple 2-matching in a graph if to be a subgraph of H with maximum 
degree 2, i.e. a collection of vertex-disjoint paths and cycles. We assume that the 
vertex set of a simple 2-matching in H is V{H). So some paths in a 2-matching could 
be isolated vertices. If F is a 2-matching let k{E) be the number of components in F 
and let p{F) be the number of path components in F. If F is a simple 2-matching in 
H and x e V{H) then we define F^ to the component in F that contains the vertex 
X. Finally, if F is a simple 2-matching, wc let V{F) denote the set of vertices covered 
by the path components of F. The first step in the proof is to establish that G2 has 
a simple 2-matching with few components. 

Lemma 3. Whp G2 contains a simple 2-matching F2 with ^(^2) < j^^^- 

Lemma 3 is proved in Section 3. The proof uses an extension of the Tutte l-factor 
theorem. 

The Extension- Rotation Procedure defined in Figure 1 converts F2 into a Hamil- 
ton cycle. This procedure takes as input a fixed graph G2 — {[n\,E), the set of arcs 
A-i, (which can also be viewed as fixed) and a 2-matching F — F2 in G2, and, if 
successful, it produces a Hamilton cycle. 

For the purposes of analysis we assume that, while G2 is given, K\L and the arcs 
A3 are not specified ab initio. At each stage there is a set of possibilities for the parts 
ol K\L and A3 that have not been revealed. The algorithm "sees" G2 but does not 
know the orientations of the edges in D2. It "learns" about K\L and A3 as needed. 
More precisely, wc condition on G2 and there is a probability distribution for K\L. 
Information needed to answer the question "is x E K\L" is revealed as the algorithm 
proceeds. Similarly, the random arcs in A3 are revealed one at a time as necessary as 
the algorithm proceeds (note that the heads of these arcs are uniformly distributed 
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Figure 1: Extension Rotation Procedure 

While k{F)+p{F) > 1: 

If V{F) = 0: 

Since G2 is connected, there exists an edge {x, y} E E such that 
and F^ are distinct cycle components. 

Replace F^ and with a single path using the edge {x, y}. 

If ViF) ^ 0: 

Let P be a longest path in F. 

Let X be the set of vertices spanned by P. 

Let END be the set of end-points of paths that span X. 

If there exists y} E E such that x G END and y ^ 'P{F): 

Replace P and F^ with a single path. 

If there exists {x, y} e E such that x e END and y e V{F) \ X: 

Replace P and F^ with two paths by extending a path 

on X ending with x along F^ to one of the endpoints of F^. 

If N{x) C X for all x e END: 

Choose an ordering of the elements of END uniformly at random. 

Consider the elements of END one at a time. For each such x e END: 

Let END(x) be the set of endpoints of paths spanning X 
such that X is the other endpoint. 

Check whether or not x e K\L. 

li X & K\L determine the arc {x, y^) £ ^3. 

II X & K\L and yx £ END(x) then use {x, y^} to replace P with 
a single cycle, add {x, yx\ to E, and move onto the next 
iteration of the procedure. 

11 X ^ K \ L OY yx ^ END(x) then move onto the next element of END. 

llx ^ K\L OY yx^ END(x) for all x E END then FaiL 
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on [n]). Thus we follow the "principle of deferred decisions" in a way made precise 
below. 

Appealing to Lemmas 2 and 3, we assume G2 is connected and ^(-^2) ^ Qn/logn. In 
the course of the procedure wc itcratively modify the 2-matching F and occasionally 
add edges from to the edge set E. In each iteration we attempt to reduce k{F), 
extend the longest path in F while keeping k,{F) and p{F) fixed, or replace a path 
component in F with a cycle component (thereby reducing p{F)). We use edges from 
E whenever possible. We resort to the arcs in A3 only when we cannot perform any 
of the operations using only the edges in E and only to replace a path component 
with a cycle component. Since p{F) only increases in situations where p{F) = 0, and 
in this case we increase p{F) to 1 and simultaneously reduce k(F), it follows that the 
number of arcs from A3 that we use is bounded by k{F) + p{F), which we assume is 
at most 12n/ logn. 

In order to analyze this procedure, we condition. For a graph G we set 

n'i ^{Denk:Tn = G}. 

We prove that the Extension-Rotation Procedure succeeds whp, and thereby prove 
Theorem 1, by working with the partition JF of fi^ x [n]'^ in which {Di, A) and {D[, A') 
are in the same part if = F/)/ and T^^ = Tjy^ (in other words, each part is a 
subset of a set of the form fl^ x [n]'' for some graph G). One virtue of this partition 
is that if we condition on {Di,A) e x [n]'' then K \ L is not determined, it can 
be viewed as a random set (L, on the other hand, is determined). Of course, the set 
i^' \ L is not a uniform random set, but we will see below that it is approximately 
uniform. Another virtue is that the progress of the Extension-Rotation procedure 
depends on as opposed to D2 and so it is identical for all digraphs in the same 
part of the partition, up until the first time we check a vertex for membership in 
K\L. This check leads to a refinement of the partition. 

We define a filtration T — Tq^T\^ . . . of Qjt x \p}^ . is the trivial partition with 
one part that is equal to Vlk x [nY . The partition Ti is determined by T^^ and the 
third out-arcs of the vertices in L: {Di, A) and {D[, A') are in the same part of J-'i if 
F/)j = Fx)^ and the arcs in A with tails in L{Di) = L{D[) are identical to the arcs 
in A' with tails in L{D[) = L[Di). The remaining partitions are then determined 
by the execution of the Extension-Rotation Procedure. We refine the partition each 
time we consider an element of END. Thus, each time we arrive at a 2-factor F such 
that V{F) 7^ and N{x) C X for all x G END, where X is the vertex set of a longest 
path in F and END is the set of endpoints of paths spanning X, we will likely refine 
the partition many times. This is because we will likely have to check many elements 
of END before we find x e END such that x e X \ L and e END(x). Let Cj be 
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the first i vertices checked in the Extension-Rotation Procedure. Further, define 

Ii = Cin{K\ L) and Oi = LU{Ci\K). 

In words, /j are the vertices among the first i checked that are in K \ L and Oj 
is L together with the vertices among the first i checked that are out of K \ L. 
If ordered pairs {Di,A) and {D[,A') are in the same part of J-'i then F^j^ = F^j', 
Ii{{Di, A)) = Ii{{D[, A')), Oi{{Di, A)) = Oi{{D[, A')) and the arcs in A with tails in 
li are identical to arcs in A' with tails in 7j. 

If A" is a part in J^i then Vi{X) = {Di e Qk ■ 3^ e N'' such that {Di,A) e X}. 
Note that if {Di,A) is chosen uniformly at random from X then Di will be chosen 
uniformly at random from 'Di(A'). 

We show that throughout the algorithm nearly every part X of the partition J-'i has 
the property that nearly all the digraphs in X satisfy certain structural conditions 
(i.e.. small maximum degree, no small dense subgraphs, etc.). For X a part of the 
partition J^i we define Fx to be the underlying graph determined by X (i.e. F£)j+/i2 
together with the edges from E3 that have been added by the extension-rotation 
procedure) and Ax to be the out-arcs from A^ determined by X. Furthermore, 
define Cx, Ix and Ox to be the sets of vertices checked for membership in i^' \ L, 
determined to be in i^T \ L, and determined not to be in i^T \ L, respectively, en route 
to the part X. We discard A* if F;^;' has does not have the desired structural properties 
or if too many digraphs D e T>i{X) do not have the desired structural properties. 
In Sections 4 and 5, respectively, we formally define two events: Eg a collection of 
graphs and Ed a collection of digraphs, see (14) and (22) respectively. We will prove 
that if {Di,A) is chosen uniformly at random from Vtk x [nY then 

Pr(3S C A3 such that Td,+a^+b e Eg) = o{l) (1) 
Pr(L>i eEn ^ Tn, ^ Eq) < e''"^"^. (2) 

At stage i of the algorithm we discard each part X of jFj such that F;^' ^ Eg- Wc also 
discard X if \Vi{X)f\ED\ > n2e-'°s'"|D^(A')|. We bound the probability that digraph 
D is discarded at stage i by applying the following version of Markov's inequality. If 
E is an event and ^ is a partition of our probability space then 

Pr[Pv[E\g] > £] = Pv[E[l£\g] >e]< -E[E[l£\g]] = ^ (3) 

£ £ £ 

Applying (3) with e = n^e^^°^^" we see that the probability that there exists an index 
i such that our randomly chosen {Di, A) is discarded at stage i due to (2) is bounded 
above by 1/n. 
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So, we restrict our attention to parts X of the partition jFj in the filtration given 
by the Extension- Rotation algorithm such that ^ £g ^-iid the probability Di is 
in Sd where Di is chosen uniformly at random from is at most n^e^^°^ 

The remainder of our analysis hinges on the following two Lemmas. Of course, we 
must only show that with high probability the Extension Rotation Procedure does 
not arrive in a situation were the longest path in the current 2-factor cannot be 
completed to a cycle using the arcs in A3. To this end, we first show that there are 
many end-points of paths that span the vertex set spanned by the longest path in the 
2-matching. 

Lemma 4. Let G be a fixed graph on n vertices such that S{G) > 3. Suppose further 
that P is a maximal path in G, x is an endpoint of P, X is the set of vertices spanned 
by P and END(a;) is the set of end-points (other than x) of paths that span X and 
have X as one end-point. Then we have 

n 

G^Sg ^ END(a;) > 



100 



Lemma 4 is proved in Section 4. One consequence of Lemma 4 is that each time we 
find X G END such that x & K\L then the probability that the third edge out of x 
completes the cycle is bounded away from zero. Thus, whp the number of cycles we 
close among the first j elements of K \ L found is at least j/200 for, say, j > i/n. 
Therefore, the number of elements oi K\L found before the successful termination of 
the Extension- Rotation Procedure is whp at most 1200n/logn. It remains to show 
that each x G END we consider (that is in neither Ix nor Ox) has a good chance of 
being in K\L. 

Lemma 5. Suppose X is a part of the partition Ti such that Fx ^ So and 

\£DnVi{X)\ <nV'°s'"|i?,(A')|. 

Suppose further that \Ix\ < and \0x\ = o{n). If D is chosen uniformly at 

random from Vi^X) then 



xeV: Pr(x G K{D)) < - ^ 



n 4000 (log log n) 



o{n). 



Lemma 5 is proved in Section 5. It follows from this Lemma that whp the number 
of elements oi K\L found among the first i vertices queried is at least i/(log^''^ n ■ 
8000(loglogn)^) for, say, i > ^Jn. Indeed, as we explore END we can ignore the o(n) 
vertices in Ix ^ Ox and the probability a queried vertex is in \ L is at least 

2 1 



3 4000 log^/2n(log log n)^ 
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as the probability that the queried vertex is among the 'bad' vertices described in 
Lemma 5 is at most 1/3. Thus, the number of elements of K \ L identifies among 
the first i vertices queried dominates i?m(i, l/(6000(loglogn)^ log^^^ n)). Let N be 
the number of vertices queried for membership in \ L before completion of the 
Extension-Rotation Procedure. Assuming the rare events discussed above do not 
occur, we have 

^ 1200n 

log^/^n • 8000(loglogn)3 ~ logn ' 

and so 

N = Oin 



logV^n 

Therefore, whp we have \Ox\ — o{n) throughout the execution of Extension- Rotation 
(inductively justifying our application of Lemma 5). 



3 A simple 2-matching with few components 

We apply the following consequence of the Tutte-Berge matching formula (see Theo- 
rem 30.7 in Shrijver [18]). If G = {V^E) is a graph and A. B are disjoint subsets of 
V then we let e(A, B) denote the number of edges in E that intersect both A and B 
and e{A) denote the number of edges within A. The size of a simple 2-matching F 
is the number of edges in F. 

Theorem 6. Let G = {V, E) he a graph. The maximum size of a simple 2-matching 
in G is equal to the minimum value of 

\v\ + \u\-\s\ + Y. 

X 

where U and S are disjoint subsets of V, with S an independent set, and where X 
ranges over the components of G — U — S. 

We make extensive use of the following observation: 

S C (^"^^ ^ Pr{S C E{Gs.out)) < . (5) 

This is easily seen to be true for jS"! = 1 and after this, knowing that S' C E{G^-out) 
only reduces the probability that e ^ 5" is also in E{G^-out)- 

We define £^cycics to be the collection of graphs on vertex set [n] for which there are 
at least n'^/^ vertex sets C such that \C\ < G[C] is connected and G[C] has at 
least \C\ edges. 



e{X, S) 



(4) 
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Lemma 7. With high probability G2 ^ ^cycles- 

Proof. Wc compute an upper bound on the expected number of small complex com- 
ponents in r^) where D is drawn uniformly at random from Q: 

log n log n 

c=3 ^ ' \ / \ / c=3 

The Lemma then follows from Markov's inequality. □ 
Lemma 8. With high probability a{G2) < 0.415n. 

Proof. Let Z be the size of the largest independent set in Gi = F^i^. Applying the 
union bound (and noting that a maximum independent set is also a dominating set) 
we have 



I J \ n 



'3i / • \ 3 

1-11- — ) 1-- 

n — I J \ n 



We checked numerically that the function 

(1 _ xfx (1 _ (1 _ ,^)3g-3x/(l-x)^|l-^ 



n—i—k 



X^(l — X 



1 1 — x 



is strictly less than one for x > 0.415. We conclude that Z < 0.415n with high 
probability. As Gi is a subgraph of G2, we have the desired result. □ 

Rather than work with the event implicit in Theorem 6 directly, we consider a weaker 
event. Let Q be the event that there exist disjoint vertex sets U, R such that 

(i) V\{ULiR) can be partitioned into sets 5", T such that G[T] is a forest and G[S] 
is an independent set. 

(ii) > max{|C/|,2n/(logn)}, 

(iii) e{V\{UUR)) + e{R,V\{UUR)) = e{SUT)+e{R, SUT) < \V\ + \S\-3\U\-\R\. 

We now argue that G ^ Q U ^cycles implies that G has a simple 2-matching with at 
most 6n/logn components. Suppose G ^ Q U £^cycics- Let F be a simple 2-matching 
of maximum size. Since G ^ ^cycles; F has at most 4n/logn cycle components, and 
therefore the number of components in F is at most An/ logn plus n minus the size of 
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F. So, it suffices to show that G has a simple 2-matching with at least n — 2n/ \ogn 
edges. Let U, S be an arbitrary pair of disjoint subsets of V where S is an independent 
set. We may assume l^l > max {2n/ logn, |?7|} (otherwise the expression in (4) is 
clearly at least n — 2n/ logn). For the given U and S, let R be the set of vertices in 
non-tree components in G2[V \ {U U S)]. Set T ^ V \ {U Li S Li R). Note that the 
graph induced on T is a forest. Suppose this is a forest with p components. Now, 
since G ^ ^cycles implies that there are at most An/ logn components in G[R], we have 



E 



eiX,S) 



> 



e{S,T) + e{S,R)-p-^ 



On the other hand, G implies that condition (iii) does not hold and we have 

e{S, T) + e(5, R)>\V\ + \S\-^\U\-\R\- e(T) 

= \V\ + \S\-^\U\ - \R\ - \T\ +p 
= 2\S\-2\U\+p. 



Therefore, 



E 



X ^ 



e{X,S) 



> \S\ - \U\ - 



2n 
logn 



Applying Theorem 6, we see that there exists a simple 2-matching F in G with at at 
least n ~ 2n/ logn edges. 

It remains to show that G2 ^ G with high probability. We apply the first moment 
method. Throughout these computations we take if to be a fixed set (the set L, 
however, is not determined). For ease of notation we set = \U\,r — \R\,s — \S\, 
etc. We consider 3 cases based on the value olu/n. 

Case 1. u< 

Here we take special care with the vertices in 5" U T that are in K. For disjoint sets 
[/, Z, W such that U,Z <ZV and 1^ C if let Qu,z,w be the event that conditions (i), 
(ii) and (iii) are satisfied with 

SVJT^ZVJW {S\JT)r\L^ Zr\K {S\JT)r\{K\L)^W. 

In words, Z is the set of vertices in 5 U T of out-degree 3 in Z^i + A2 and W is the 
set of vertices in 5 U T of out-degree 2 in Di + A2. Note that this event includes 
conditions on arcs that point at the vertices in Z n K and W; in particular, there is 
at least one arc directed at each vertex in W. Let a be the number of arcs directed 
from Z LiW — S LiT to S LiTLi R and let P be the number of edges that are directed 
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from Z U W = S U T to U. Finally, let 7 be the number of arcs directed from 
R^V\{UiJ SUT) into vertices in W. 

Before proceeding with the calculation, we establish an upper bound on a (and hence 
a lower bound on /?). Note that, as we assume condition (iii) is satisfied, we have 

a + 7 < e{S UT) + e{R,S Li T) < \V\ + \S\ - 3|C/| - \R\ = 2s - 2u + t. 
Therefore 

a < 2s - 214 + t - 7 and /3 > s + 2t - w + 2ii + 7. 
The latter follows from a-\-l3 — ?>s-\-?>t — w. 
For fixed C/, Z, W we have 

Pr(W) ^Yly ^ ){d [—) (6) 

Note that the last term in (6) accounts for the condition that there must be an arc 
directed into each vertex in W: There are at least — 7 vertices in W that are not 

'covered' by vertices in i?, and each of these vertices must be covered by one of the 
arcs out of U or one of the a arcs out of 5" U T that point to vertices in S* U T U /?. 

We bound the probability that there exists a pair [/, R satisfying the conditions of 
the event Q with \U\ < n/e^^ by summing over all possible choices of U, Z and W: 



-(uw).i:(:)(:)a) 



\U,Z,W / u,z,w 



9^223(s+t) 



n/ \ u 

s+t—w 



w 

2S+t 



u.z.w ^ ' ^ 



2 J \n/ \ u 

Ibe-u \ I s + t 



s + t J \ u 



Note that to get the final summand we dropped a factor {u/nY and we used the 
bound 



S + t \ I S + t 



s + t — w J \ w 
12 



< 2 



s+t 



(This can be viewed as a special case of equation (20), which is used extensively in 
Section 4.) We bound the final summand in (7) by considering two subcases. Recall 
that we assume s > 2n/ \ogn. If s + 1 > 32e-u then we write 



s + t, 



If w is a positive proportion of s + 1 then the term suffices to establish the desired 
upper bound, and otherwise the (1/2)*+*^"' term suffices. If, on the other hand, 
s + t< 32eu then (g)"' (^)"' < 1 and we write 



\ n / \s + t J V 
and again we have the desired upper bound. 
Case 2. n/e^^ <u< O.OlSn. 

Suppose first that s + t = n — u — r> 8eu. As above, we let f3 be the number of 
arcs directed from S U T to U. We have f3 > s + 2t + 2u — o{n). The probability 
that there exist U,R that satisfy condition (iii) with n — \U\ — \R\ > 8e\U\ and 
n/e^^ < \U\ < O.OlSn is bounded above by (we write n — u — r — s + t) 



^ \nJ \ n J \s + t 



u.r 



Note that the (^)" term is exponentially small and dominates the e"^"^ term and the 
other terms are at most 1. 

Suppose now that s + t = n — u — r < 8eu. Here we add the consideration of the edges 
from i? to U T to the previous bound. Let 7 be the number of arcs directed from R 
to S UT. Now we have (3>s + 2t + 2u + 'y — o(n). The probability that there exist 
U, R that satisfy condition (iii) with n-\U\-\R\< 8e\U\ and n/e^^ < < O.OlSn 
is bounded above by 

.&(:)(. :,)("V")o'C)(¥)'(-^'-"' 

'^•'••EO'(^,)"'-Q"""' " 

\ 7 J \ n J \ n 
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We write 



"^^^ " n \ (l_ 1±I ] < e3'-^Tre-'^(3r-7) < g-^(3n-6u-3t-3s-7)_ 



7 / \ ^ 
Since < a < 2s — 2u + t — we have 7 < 2s — 2ti + t and 

6ii + 3i + 3s + 7 < 4ii + 4i + 5s < (4 + 5 • Se)u < 1.99n. 

So, we can bound the expression in (8) by 



u,r ^ ' ^ ' «,r ^ ' 

<e°(")5:(^)"e"(f) 

u,r 



As 0.015 ■ e ■ e*^/*^ < 1, we have the desired bound. 
Case 3. 0.015n < u. 

Let U,R (and S'UT = V\{UUR)) be fixed disjoint sets such that \U\ > 0.015n. Let 
a be the number of arcs directed from 5'UTtOiS'UTUi? and ( be the number of 
arcs directed from R to S UT. The probabihty that there exist such sets U, R with 
the specified number of arcs is bounded above by 

y~v — U\ /3{n — U — r)\ W ^M^3(n— u-r-)-a 



o \ / , \ C / , \ 3r— C 

or\ ( u + r \ I u + r ^ 



u,r,a,C 

As we are bounding the probabihty of the event Q, we assume (in particular) that 
condition (iii) holds; that is, we assume, 

a + C = e{SUT) + e{SUT,R) <n + s-3u-r, 

which implies 

a<n + s-3M-r-C = 2s-2M + t- C<2s + t-2M. (10) 
Now the expression 

3{s + t)\f^_uy^uy<^ (H) 



a / \ n/ \n 
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is increasing for a < 3(1 — u/n){s -\-t) — u/n. Given (10) and u < s < a{G'2) < 0.5n 
(by Lemma 8) we have 

3{1 -u/n){s + t) -u/n- {2s + t-2u) = {1 - 3u/n)s + {2 - 3u/n)t + 2u - u/n 

> {1 — 3u/n)s + 2u — u/n 

> 0. 

Thus in our range of interest, the expression (11) is increasing in a. 

Therefore, the probabihty that there exist disjoint sets U, R with \U\ — u and \R\ — r 
that satisfy conditions (i), (ii) and (iii) is bounded above by 



o{n)y^ fAfn-u\/ 3{n-u-r) \/^_uy 
^ V«/ V r J\n + s-3u-r-Cj\ n) 



qi \ n+s—3u—i — C / n \ 2n—s—2r+C—o(n) 



where we set s = min{n — u — r, 0.415n}. Note that the only role s plays in (9) is in 
the upper bound on a. So, we take the maximum possible value of a by taking the 
maximum value for s (and this has no impact on the rest of the expression). Also 
note that we apply Lemma 8 to set an absolute upper bound on the s. 

We turn now to calculations. We re-name u — > xn, r — > yn, 7 — > zn. Then for a> b, 
we let Bin(a, b) = be a replacement for a corresponding binomial. We now 

define the function of three real variables 

g{x, y, z) = Bin(l, x) ■ Bin(l — x,y) ■ Bin(3(l — x — y),a) ■ Bm{3y, z)- 
where 

ot{x, y,z) = l + s{x, y,z) -3x-y - z p{x, y,z) = 2- s{x, y,z)-2y + z 

s{x, y, z) = min{l — x — y, 0.415}. 

Note that y < 1 — 2a; is equivalent to u < s + 1. 

Claim 9. // 0.015 < x < 0.415, < y < 1 - 2x and < z < 3y then g{x, y, z) < 
0.995. 

Note that Claim 9 implies that the expression in equation (12) is exponentially small 
for all relevant choices of r, m, C, and Lemma 3 follows. 
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It remains to prove Claim 9. Of course, this can be achieved by a direct assauh using 
calculus (e.g. concavity is useful). However, the authors feel that a simple check by 
computer is more compelling. Posted at www.math.cmu.edu/~tbohman the reader 
will find a short program in C that divides the 3-dimensional region into small cubes 
(side length 0.00125) and bounds the function over each cube, thereby bounding the 
function over the entire region. The numerical bounds on the expression in (13) are 
achieved by bounding each term in (13) individually over each sub-cube in a trivial 
manner. Problems with numerical rounding are avoided 



4 Rotations: Proof of Lemma 4 

We begin by defining the event Eq- This event is comprised of three properties: large 
maximum degree, small dense sets and a more complicated non-monotone condition 
for larger sets. Set 

^ ^ 21ogn 
log log n 

The event 

^small U ^pieces 

(14) 

where £^smaii is the event that there is a vertex set S and an integer i such that 
l<i<10, |S'|<| \ogn and the graph induced by S is connected and spans at least 
15*1 + i edges. £^picccs is defined immediately following the proof of Lemma 10 below. 

The events A{G) > Aq and 6'smaii are increasing and so we can bound the probability 
that there exists B C. A3 such that rGi+A2+s is in one of these events by working 
with Q itself. 

Lemma 10. If D is chosen uniformly at random from fl then 

Pr(A(rc) > Ao V Pc e Ssmaii) = o{n-'/^). 
Proof. The degree of a vertex in P^i is distributed as 3 -|- Bin(3n, 1/n). Therefore 

Pr(A(P^) > Ao) <n(^^^''_ 3);^ = -(^"^Z^)- 
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Another application of the union bound gives 

10 

1 s 



4- log" 

Pr(rc e ^small) 



s=4 



1+ 



E/ ne D 
V s ^ n 



s=A 



\ log' 



3£^ 



■ log 71 



■ log n 



= on 



-l/2x 



□ 



Let A^(5') — {y ^ S : 3x E S such that y is adjacent to x} be the set of neighbors of 
a vertex set S. Define £^pieces to be the event that there exist sets Si,S2,T-i,T2 such 
that 

(i) Si and ^2 are disjoint with n^'^ < \Si \ + \S2\ < 

(ii) For each x E Si there exists Ux E SiVJ Ti such that {x, Ux] is an edge, with the 
additional property that if m, a; G 5*1 and u = yx then x 7^ y^- 

(iii) For each x & S2 there exist distinct yx,Zx G T2 such that {x,yx}, {x, Zx} are 
edges. 

(iv) Ti ^ {yx : X e Si and y^; ^ 5"!} and T2 = {y^;, Zx : x e S2}. 

(v) Ar(S'i), 7V(S'2) C Ti U Ts and ^'2 is an independent set. 

Lemma 11. Let {Di,A) be chosen uniformly at random from Qk x [n]^. 

Pr(3B C ^3 : Td.+a.+b e £:pieces) < e-^("^'"). 

The proof of Lemma 11 is given in Section 4.1 below. Note that Lemmas 10 and 11 
imply (1). We now proceed with the proof of Lemma 4. 

Let P he X = xo,Xi,X2, . . . and S = END(a;). We show 15*1 is large in two stages. 
Stage 1: \S\ > n^/^o. 

Let Tj be the set of end-points (other than Xq) of paths that can be obtained from P 
by i rotations with xq fixed. We view the Tj's as the level sets in a tree rooted at Xq. 
Note that the children of a vertex y e Ti in this tree are second-neighbors of y in G. 
Since each end-point y E Ti yields d{y) — 1 end-points in Tj+i and S{G) > 3 we have 

|T,+i| >2|T,|-5,, (15) 
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where 5i accounts for vertices in Tj that have second-neighbors in common. 

If for some £, p we have Yli=i ^i — P then there exists a connected subgraph H with 
V < Mp vertices and at least v + p — 1 edges. Since G ^ ^^smaib it follows that p < 10 
implies 

1 IX 

4:£p>v> log n =^ £> log n. 

4 p 16 

Therefore, 

(logn)/18 
i=l 

and 

|^|>|T(logn)/18|>2^-^>nV30 

for n sufficiently large. 
Stage 2: \S\ > ^. 

Suppose that the vertices in S induce a collection of sub-paths Pi, P2, • • • , Pm of P 
and that Pi contains Sj vertices. Let Y be the collection of paths Pi that consist of a 
single vertex. (So, if Sj = 1 then Pj e Y. Note that for these paths we abuse notation 
by referring to a single vertex as a path.) Posa [16] proves that 

Xi e N{S) ^ Xi-i E S or Xi+i e S. (16) 

Note that (16) implies 

ueY and veS ^ {u,v} ^ E{G). (17) 

Set 

5*2 = Y, T2 = {xi : Xi_i e ^2 or Xi+i e S2} 

Si = S\Y, Ti = {xi : Xi ^ S and {xi-i e Si or Xi+i e Si)}. 

In words, S2 is the set of vertices that are singleton paths in the collection Pi, ... , Pm 
and ^i is the set of vertices in longer paths. The sets T2 and Ti are the vertices 
adjacent along P to the end- vertices of the paths in Pi, . . . , P^. 5*1, 5*2, Ti and T2 
satisfy conditions (ii), (iii), (iv) and (v) in the definition of the event ^^pieces and 
G ^ i^picccs- Here for x e S2, Vxi^x a-re its two neighbors on P. For each non-singleton 
path Pi the vertex x G Pi closest to xq along P lets be its neighbor along P closer 
to Xq and all other v E Pi let be the neighbor along P in the direction away from 
xq. It follows that 1^1 = \Si\ + \S2\ > 
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4.1 Proof of Lemma 11 

Wc apply the first moment method. 
Let s = \Si\ + 15*21 and u = \S2\- 

We view K as a fixed set. The set L, on the other hand, will be random. We then let 
K' G K he the tails of the arcs in A\{A2UB). So K' is the set of vertices (for a given 
choice of B) for which the third out-arc is still not in the graph. For X C. K and 
sets Si,S2,Ti,T2 C [n] let £x,SuS2,n,T2 be the event that K' ^ X and -Si, ,^2, Ti, Ta 
satisfy conditions (i)-(v). We bound the probability of the union of all events of this 
form using the first moment method. A key property of this event is that, since we 
set X = K', each vertex in X has in-degree at least 1. 

We do not work with the events Sx,Si,S2,Ti,T2 directly. Rather we consider the events 
^Y,Si,S2,Ti,T2 where Y C (5*1 U 5*2) H K defined by 

^Y,Si,S2,Ti,T2 = V ^X,5i,52,Ti,T2- 

XQK: 
Xr\{Si\JS2)=Y 

In words, we consider the event where we specify membership in K' only for those 
elements of K that are in U 82- We choose F, 5*1, 5*2 using the following method 
(in fact, we break 5*2 into more parts in the actual calculation below, we indicate 
here how one can recover the set Y from the expression given below). We choose 
Q,RQ [n] and Q,RCK. Then we set 

Si = QuQ S2 = RUR Y ^QUR. 

The important point to note here is that if a; G {RUQ)r\K then we are considering the 
event where x ^ K'. So the third arc out of x is present in the graph. Furthermore, 
as we specify Q U i? C K', there is at least one arc pointing into each vertex in QUR. 

Now we explain the further division S2. We write u = + + 1^2 + Oq + ui + 1)2 where 
Vi is the number of vertices x e 5*2 \ -ftT' for which i of the arcs between x and {yx, z^} 
are directed into x and i>j is the number of vertices x E S2r\K' for which i of the arcs 
between x and {y^, z^} are directed into x. Let p— \Si\K'\ and p— \Sir\ K'\. The 
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probability that is in ^^pieces is at most 




Explanation of (18) View the summand as five terms separated by the 
symbol x . The first is an upper bound on the number of choices for Si, S2 
and Y. For such a fixed choice of Si, S2 and Y the rest of the expression 
in (18) is an upper bound on the probability of the union of all events of 
the form ^y,Si,S2,Ti,T2- The second term is an upper bound on the number 
of choices for Ti, T2 and the exphcit edges between Si\JS2 and Si\JTi[JT2 
given by conditions (ii) and (iii). Then we consider the probability that 
the vertices in S2 have the prescribed neighbors. Note that if v is among 
the uq vertices in 5*2 n K' that has out-arcs directed to and Zy then, 
since v E K' and therefore v has in-degree at least 1, there is a vertex in 
Ti U T2 that directs one of its out-arcs to v. The accounts for the 'extra' 
6s /n factor in this term of the product. The fourth term is a bound on the 
probability that the arcs out of are in the prescribed locations. Finally 
we multiply by the probability that no vertex outside of 5*1 U 5*2 U Ti U T2 
sends an arc into U S2. 

The bound in (18) can be approximated from above by 
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We repeatedly apply the following observation (the proof of which is given below) : 
The expression in (19) can therefore be bounded above by 

n/lOO 



gS(-3+9s/n+3fc/n) x X ( 

-„1/10 ^'0.1'l,i^2 ^ 



£(s, A:, n) 



1^0,1^1,1^2 



where 

£{s, k, n) = 24s(s/n) + 12s{k/n) + 6s + 36s{s/n) + 36s(s/n)2 + 6s(A;/n) 

+ 12s(/e/n) + 18s{ks/n^) 
= s (6 + 60s/n + 36sV^^ + ^(l)) • 

Since we assume s < n/100, we have i{s, k, n) < 6.7s and the summand in (19) is at 
most 

< (0.995)^ 

Thus the sum in (18) can be bounded by n*(.995)"^^^°, and this proves Lemma 11. 
Proof of (20). Let f{x) = {^Y (iz^)'"^- Then we have 



= log 



log a — 1 — log X — log l3 + 1 + log{z — x) 
z — x)a' 



(5x 



So, /'(x) = oifrx = x* = ^. 

Differentiating once more, we get 

f"{x) f{xf _ 1 1 
f{x) f{xY X z — x 

and so f"{x*) < and x* is the maximum over x & [0, z]. □ 
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5 Conditioning: Proof of Lemma 5 



We begin with some definitions and initial observations, including the definition of 
the event Sr,- Set 

T = 3 - - — Jyr— = 3 - a ^ = [lOlogr log^l • 

log ' n 

Recall that A* is a fixed part in some partition in the filtration of Vt^ x [nY given by 
the execution of the Extension-Rotation Procedure. Here we work with the space of 
digraphs Q^k] the edges that have been added to raise the minimum degree to 3 or in 
the course of the Extension-Rotation Procedure play no role. 

For X E [n\ \ {Ix U Ox) let Vi[X) be partitioned into V^^V^ where D & ii 
X e K{D). Note that for fixed x we have Pr(a; e K{D)) = \V^\/\Vi{X)\. 

Define a bipartite graph S = S^; on vertex set T>i{X) by including an edge joining 
D G and D' G Vj. if D' is obtained by reversing the arcs along a shortest path of 
length £ in D that starts at a vertex in V \ {K{D) U Ox) and ends at x. We have, 
where ds denotes degree in E, 

J2 d^{D) = Yl MD). (21) 

We get a lower bound on |I>j;|/|Pi(A')| by bounding the degrees in E: We estabhsh 
upper bounds on ds(-D) for D e and lower bounds on d^{D) for D e V^. 

For a digraph D E Qk let IT^i x-^y be the set of vertices y E V such that there 
exists a shortest path of length i from y to x. Note that r is the average out-degree 
in D G Ofe and so ~ log^° n is approximately equal to the expected number of 
vertices reachable by a path of length i. Let IId,x^k be the set of vertices z E K 
such that there is a shortest path from x to z of length £. We bound |n/)^j;<_y| and 
|nj5;j.-.A'| for most D G T>i{X) and x G [n\. To this end, we define the following 
cut-offs: 

log^° n k 

TTin = (1000 log log n) log^° n Tlout = J. — ; ^ ■ - 

(log log n)^ n 

Pin = (logn)2°. 

Let "if^v ^ be the event 

lOn 

\{x G [n] : in^i^^yl > TTin}! > J. TTT^ or 3x G [n] such that \Ud,x<-v\ > Pin- 

(logn)^^" 
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Similarly, define "^-^k Q to be the event 

\{x e [n] : \IId,x^k\ < 7^ont}\ > 

We now define 



(logn) 



100 



£d = U *^y. (22) 

Lemma 12. If D is chosen uniformly at random from Qk then 

Pr{D eSo A Td^ Eg) < e-^'^^'". 

The proof of Lemma 12 is given in Section 5.1. Note that condition (2) follows from 
Lemma 12. We now proceed with the proof of Lemma 5. 

For each D e T>i{X) we define three sets of vertices: 



Vi = Vi{D) = {xe [n] 
V2 = V2iD) = {xe [n] 
V3 = Vs{D) = {xe [n] 



\^D,x^v\ > TTin} 
|nD,,r^i^| < TTout} 
\^D,x^K n /a'I > 7rout/2} 



The key observation is that most x G [n] have the property that x G Vi{D) for very 
few D G Vi{X) for i = 1,2,3. Indeed, the number of ordered pairs {x,D) where 
X G [n], D G Vi{X)\-^^v and x G Vi{D) is at most 10|Di(A')|n/(log^°°n). It follows 
that 



\x G [n] : \{D G V,{X) \^^y:xe V^{D)}\ > ^iMj 
I log n J 



< 



lOn 
logn 



Similarly, we have 



(x G [n] : \{D G V,{Z) : x e V^P)}! > ^^^j 

I log n J 



< 



n 



logn 



(23) 



(24) 



Now we turn to the set of vertices x G [n] with the property that x G V3{D) for many 
digraphs D. Consider a fixed digraph D G \ ^^y. The number of ordered 

pairs (a, b) where a & V, b & Ix and there is a shortest path of length i from a to 6 
is at most 

lOn ^1^1 _ nlog^°n 2 

100— • Pin + ■ ^in < — j (loglogn) 

log n log n 

for n sufficiently large. It follows that |V3(L')| < 2n(loglogn)''/(log^/^ n). Therefore 



\x G [n] : \{D G V^{X) \ : x G V,{D)}\ > ^^^] 
L log ' n ) 



< 



n 



log^/^n' 



(25) 
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To prove Lemma 5 it suffices to show 



xe[n]\ (/;, U O;,) : 



< 



1 



k 



\Vi{X)\ - 4000(loglogn)3 n 



o{n). 



Wc may therefore restrict our attention to vertices x in the compiements of the sets 
in equations (23), (24) and (25). 

Now apply (21). We have 



V log^'^'n log'^nj 2 

DeV^ ^ to 6 / 



Therefore 



'''"out \ 



I TTout 



m (^TTin + ^j >|Pl(A')|^(^l 

and 



2pu 



> 



TTout 



I Pi (A') I - 47rin 4000(loglogn)3 n' 



5.1 Proof of Lemma 12 



We begin with the event ^^y. Let x be a fixed vertex. For i — 1, . . . ,ilet Si = Si{x) 
be the set of vertices that can reach x in a path of length i but not via a shorter path. 
We reveal the sets Si iteratively (i.e. using breadth first search). 

Claim 13. If f{n) is an arbitrary positive real function then 

Pr {\Si\ > f{ny ATn^ Sg) < e^^-ZW/io 

for n sufficiently large. 

Proof Note that Fd ^ Eg implies A{G) < Aq = and so 

j=0 j=0 
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Thus, wc may discard the set of digraphs for which there exists an index 1 < i < i such 
that |<S'j| > s/njl. In other words, we iteratively condition on the event |'S'j| < s/njl. 

Now an observation. If < A < 1 and X]j=o I'^jI — then we have 



£;[e^l^*+il I \S,\\ < 



m=0 



TTl 

m 



n — Wn 



\S,. 



n — \ n 



+ 



n — ^Jn n — ^Jn 



< 1 + A(1 + A) 
V n- ^Jn 

< gT|5i|A(l+A)(l+2/^AI)_ 



Now set Ao = 1/2. For i = 1, . . . , £ we define Aj by setting 

Ai_i = TAj(l + A,)(l + 2/V^). 
Note that so long as Aj > Tjvt}-!'^ then we have 



A,: 



A,; 



A,, 



T \ T ) T 

Then inductively, so long as Aj > r/m}/^^ we have \j < 1/{2t^) and 



A, > 



Ao 
P 



m=0 ^ m=0 ^ ' 



exp 



m=0 



r 



> 



(Note that we use the inequality \ — x> e for a; < 1/2.) In particular, we have 

1 



A^ > 



lOr^ 



(26) 



Wc now define a random variable Kj. If liSil, |iS'2|, .... I^j-il < \/n/i then set 1^ = 
l^jl. If, on the other hand, there exists j < i such that \Sj\ > \fnji then set 
Fj = Aj_iFj_i/Aj. Let i3j be the event that there exists j <i such that \S^\ > \/n/i. 
For i = 1, . . . , £ we have 

E [e^'^' \Bl]=E[E [e^'l^'l | | 



< E 
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Of course, the same inequality holds if we condition instead on B.^. It follows that 
E [e^^^^] < E [e^°^°] = e^/^. Applying Markov's inequahty and (26) we have 

Pr {Ye > /(n)/) < E [e^'^'] g-^^^^")^' < e^-^^^'*). 
As the event \Se\ ^ Y£ is a subset of To € Sq, the claim follows. □ 

Let X be the set of vertices x such that |n£),a:^y| = [^^(a;)! > 1000 log log n ■ log^°n. 
Applying Claim 13 we have 

gl/2 

Pr{xeX AFd^Sg) < r^o^- 

log n 

Therefore, E[\X\ \ Yd ^ Sg] < 2n/log^°°n and E[\X\] < 3n/log^°°n (as Pr{TD G 
Sg) = o(l/v^) and |X| < n) . We bound the probability that X is significantly 
larger than its expected value by applying the Azuma-Hoeffding inequality to the arc- 
exposure martingale on Qk- (This is an instance of the first general setting in Section 
7.4 of Alon and Spencer [2].) We fix K and expose the out-arcs one at a time. Let 
be the i^^ arc in some ordering of the 3n — k out-arcs. Set Xi — E[\X\ \ Oi, 02, ... , Oj]. 
Note that if D and D' differ by a single out-arc then we have 

e 

\\X{D)\-\X{D')\\<Y,^^<log''n. 

i=0 

It follows that l^j+i — Xi\ < log^^ n for i > 0. The Azuma-Hoeffding inequality then 
implies 

To bound the probability that there exists a vertex x such that lll^j^aj^yl > we 
apply Claim 13 with f{n) = log^^n. It follows that Prd-S^l > pin A F^i ^ Sq) < 

g-(logiOn)/10_ 

Now we turn to the event '^-.k- Let y be a fixed vertex. Let Tj = Ti{y) be the 
sum of the out-degrees of the set of vertices that can be reached from y via directed 
paths of length i but not via shorter paths (i.e. Tq = 2 or 3). As above, we reveal Tj 
iteratively. Recall that £ = flO log^ logn] . Set £1 = £ — 1 — log2 (1000 log log n). 

Claim 14. 

Pr(T,_,<r")<'. 



26 



Proof. We begin with an observation. Define 7j = 3*"'"^/(2n). Note that the probabil- 
ity that an out-arc from Tj hits a vertex that has aheady been seen is bounded above 
by 7j. Then 

E [e-^l^^+^l I \n] < (7, + ae-'' + {1 - a - ^i)e-''f'^ . (27) 

We apply (27) in two ways. First, if A > and Ae^"^-^ > 27^ then on replacing the 
first 7j by Ae~^^/2 and dropping the second 7^ we see that 

7i + ae-^^ + {l-a- 7i)e"^^ < e'^^ 

and so 

E[e-^m+i\ I \Ti\] <e-2^l^< 
Furthermore, if 87^ < A then we have 

E[e-m+i\ I \Ti\] < (7, + Q;(l-2A + 3A2) + (l-a-7,)(l-3A + 5A2))'^^' 

< (1 - A(t - 6A))I^*I 

< g-A(T-6A)|Ti|_ 

Set Ao = 1000 log log n and io = i — ii + 3. For i = 1, . . . , ^ — 1 define Aj by 



Aj-i 



2Aj Hi < io 

Xi{T - 6Xi) if i > io 



(with the additional condition Aj < r/12 in the latter case). Of course, we make 
these choices in order to establish £'[e~-^»+il^'+il] < E[e~^'^'^^^] for i = 0, 1 and 

therefore 

E;r -A^_i|T^_i|l < -AolTol] _ ^ 

Applying Markov's inequality we have that for any s > 0, 

Pr\\Te-i\ < ST^~^] < e^^-i^-'-'E[e-^^-il^^-ill < 



- log^°°° n ■ 

We now establish an upper bound on A^_i. Note first that Ajg = ^ and Ajg+i < ^. 
Then observe that i > io and Aj < ^ implies 

A.<%l(l + ?^). (28) 
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Furthermore, we have Aj < \-\/2 for all i and the bound 

pj (^1 + < exp 1^ ^1 < eW- < e^ 

then implies 

Taking s = 7--^-i+^i^ we have 

Pr(|T,_i|<r^^)<— 3^ 



log^™% 



□ 



Now let Y be the set of vertices y such that Ti_i{y) < r^^. It follows from Claim 14 
that we have -E[|y|] < n/log^°'^°n. We again apply the Azuma-Hoeffding inequality 
to the arc exposure martingale on Qk to show that the \Y\ is concentrated around 
its expected value. Recall that denotes the i*^ arc observed in this process. Set 



Y 



E[\Y\ I ai, 02, ... , tti] if Pr (A(rij) > log^ n \ ai, . . . ai_i) < ^ 
Yi^i if 3j < i : Pr (A(rB) > log' n | oi, . . . a,) > ^. 

In order to bound the one step changes in this sequence of random variables, we can 
restrict our attention to the situation where Pr (A(ri3) > log'n | ai, . . . aj_i) < 
First note that if D and D' differ by a single out-arc and A(r£)), A(rD/) < log'n 
then we have 

\\Y{D)\ - \Y{D')\ \ < (log'n)'-^ < n^/^°. 
To deal with pairs D, D' where ^{Td) or A(r£)/) > log' n we note 

Pr {^{Vd) > log' n\ai,...ai) < nPr {^{Vd) > log' n \ ai, . . . a^-i) < 1/n 

and |y| < n. Putting it all together we have \Yi+i - Yi\ < n^^^ + n/n < 2n^/^°. The 
Azuma-Hoeffding inequality then implies 

Pr(Y,„.,>Em+")<e-'"'°-"». 
V log nj 

Noting that 

Fr{Ysn-k + \Y\) < n^Pr{A{TD) > log'n) 



<r? • n 



3n 



log' n — Sy \n 
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we conclude that 



Pr {\Y\ > n/(log^^°n)) < e-(^°s'")/2_ 



(29) 



Suppose now that x ^Y. Then 



^D,x^K dominates Biij^^ ,k/n) — Bi(3^ ,3^ /n). 



(30) 



The first binomial in (30) is the number of edges from Te^i{y) to K and the second 
binomial dominates the number of vertices chosen twice by these edges. The lemma 
now follows from (29), (30) and simple bounds on the binomials: 



Pr{Bi{r^\k/n) < r^'k/2n) < e'^^'"^' ; Pr{Bi{3\3^ /n) > log" n) < e-"('°s^")_ 



Final Remark. We have established that Gs-out is Hamiltonian with high probabil- 
ity. In general, it may be the case if m > 3 then with high probability Gm-out contains 
[m/2j disjoint Hamilton cycles plus, in the case that m is odd, a disjoint matching 
that saturates all but at most one vertex. We leave this as an open problem. 

Acknowledgment. The authors thank the anonymous referees for many useful 
comments. 
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